Design and Construction of Korean-Spoken English Corpus (K-SEC)
نویسندگان
چکیده
K-SEC(Korean-Spoken English Corpus) is a kind of speech database that is under construction by the authors of this paper. This article discusses the needs of the K-SEC from various academic disciplines and industrial circles, and it introduces the characteristics of the K-SEC design, its catalogues and contents of the recorded database, exemplifying what are being considered from both Korean and English languages' phonetics and phonologies. The KSEC can be marked as a beginning of a parallel speech corpus, and it is suggested that a similar corpus should be enlarged for the future advancements of the experimental phonetics and the speech information technology.
منابع مشابه
Design and construction of Korean-spoken English corpus
K-SEC(Korean-Spoken English Corpus) is a kind of speech database that is under construction by the authors of this paper. This article discusses the needs of the K-SEC from various academic disciplines and industrial circles, and it introduces the characteristics of the K-SEC design, its catalogues and contents of the recorded database, exemplifying what are being considered from both Korean an...
متن کاملKorean Children's Spoken English Corpus and an Analysis of its Pronunciation Variability
This paper introduces a corpus of Korean-accented English speech produced by children (the Korean Children’s Spoken English Corpus: the KC-SEC), which is constructed by Seoul National University. The KC-SEC was developed in support of research and development of CALL systems for Korean learners of English, especially for elementary school learners. It consists of read-speech produced by 96 Kore...
متن کاملGrammatical Error Annotation for Korean Learners of Spoken English
The goal of our research is to build a grammatical error-tagged corpus for Korean learners of Spoken English dubbed Postech Learner Corpus. We collected raw story-telling speech from Korean university students. Transcription and annotation using the Cambridge Learner Corpus tagset were performed by six Korean annotators fluent in English. For the annotation of the corpus, we developed an annota...
متن کاملLearning Uniication-based Grammars Using the Spoken English Corpus
This paper describes a grammar learning system that combines model-based and data-driven learning within a single framework. Our results from learning grammars using the Spoken English Corpus (SEC) suggest that combined model-based and data-driven learning can produce a more plausible grammar than is the case when using either learning style in isolation.
متن کاملLearning Unification-Based Grammars Using the Spoken English Corpus
This paper describes a grammar learning system that combines model-based and data-driven learning within a single framework. Our results from learning grammars using the Spoken English Corpus (SEC) suggest that combined model-based and data-driven learning can produce a more plausible grammar than is the case when using either learning style in isolation.
متن کامل